174 research outputs found

    Plasmonic Tamm states: second enhancement of light inside the plasmonic waveguide

    Full text link
    A type of Tamm states inside metal-insulator-metal (MIM) waveguides is proposed. An impedance based transfer matrix method is adopted to study and optimize it. With the participation of the plasmonic Tamm states, fields could be enhanced twice: the ffirst is due to the coupling between a normal waveguide and a nanoscaled plasmonic waveguide and the second is due to the strong localization and field enhancement of Tamm states. As shown in our 2D coupling configuration, |E|^2 is enhanced up to 1050 times when 1550 nm light is coupled from an 300 nm Si slab waveguide into an 40 nm MIM waveguide.Comment: 3 pages, 4 figure

    Complementary Labels Learning with Augmented Classes

    Full text link
    Complementary Labels Learning (CLL) arises in many real-world tasks such as private questions classification and online learning, which aims to alleviate the annotation cost compared with standard supervised learning. Unfortunately, most previous CLL algorithms were in a stable environment rather than an open and dynamic scenarios, where data collected from unseen augmented classes in the training process might emerge in the testing phase. In this paper, we propose a novel problem setting called Complementary Labels Learning with Augmented Classes (CLLAC), which brings the challenge that classifiers trained by complementary labels should not only be able to classify the instances from observed classes accurately, but also recognize the instance from the Augmented Classes in the testing phase. Specifically, by using unlabeled data, we propose an unbiased estimator of classification risk for CLLAC, which is guaranteed to be provably consistent. Moreover, we provide generalization error bound for proposed method which shows that the optimal parametric convergence rate is achieved for estimation error. Finally, the experimental results on several benchmark datasets verify the effectiveness of the proposed method

    Hysteretic Behavior Simulation Based on Pyramid Neural Network:Principle, Network Architecture, Case Study and Explanation

    Full text link
    An accurate and efficient simulation of the hysteretic behavior of materials and components is essential for structural analysis. The surrogate model based on neural networks shows significant potential in balancing efficiency and accuracy. However, its serial information flow and prediction based on single-level features adversely affect the network performance. Therefore, a weighted stacked pyramid neural network architecture is proposed herein. This network establishes a pyramid architecture by introducing multi-level shortcuts to integrate features directly in the output module. In addition, a weighted stacked strategy is proposed to enhance the conventional feature fusion method. Subsequently, the redesigned architectures are compared with other commonly used network architectures. Results show that the redesigned architectures outperform the alternatives in 87.5% of cases. Meanwhile, the long and short-term memory abilities of different basic network architectures are analyzed through a specially designed experiment, which could provide valuable suggestions for network selection.Comment: 41 pages, 14 figure

    Class-Imbalanced Complementary-Label Learning via Weighted Loss

    Full text link
    Complementary-label learning (CLL) is widely used in weakly supervised classification, but it faces a significant challenge in real-world datasets when confronted with class-imbalanced training samples. In such scenarios, the number of samples in one class is considerably lower than in other classes, which consequently leads to a decline in the accuracy of predictions. Unfortunately, existing CLL approaches have not investigate this problem. To alleviate this challenge, we propose a novel problem setting that enables learning from class-imbalanced complementary labels for multi-class classification. To tackle this problem, we propose a novel CLL approach called Weighted Complementary-Label Learning (WCLL). The proposed method models a weighted empirical risk minimization loss by utilizing the class-imbalanced complementary labels, which is also applicable to multi-class imbalanced training samples. Furthermore, we derive an estimation error bound to provide theoretical assurance. To evaluate our approach, we conduct extensive experiments on several widely-used benchmark datasets and a real-world dataset, and compare our method with existing state-of-the-art methods. The proposed approach shows significant improvement in these datasets, even in the case of multiple class-imbalanced scenarios. Notably, the proposed method not only utilizes complementary labels to train a classifier but also solves the problem of class imbalance.Comment: 9 pages, 9 figures, 3 table

    Complementary label learning based on knowledge distillation

    Get PDF
    Complementary label learning (CLL) is a type of weakly supervised learning method that utilizes the category of samples that do not belong to a certain class to learn their true category. However, current CLL methods mainly rely on rewriting classification losses without fully leveraging the supervisory information in complementary labels. Therefore, enhancing the supervised information in complementary labels is a promising approach to improve the performance of CLL. In this paper, we propose a novel framework called Complementary Label Enhancement based on Knowledge Distillation (KDCL) to address the lack of attention given to complementary labels. KDCL consists of two deep neural networks: a teacher model and a student model. The teacher model focuses on softening complementary labels to enrich the supervision information in them, while the student model learns from the complementary labels that have been softened by the teacher model. Both the teacher and student models are trained on the dataset that contains only complementary labels. To evaluate the effectiveness of KDCL, we conducted experiments on four datasets, namely MNIST, F-MNIST, K-MNIST and CIFAR-10, using two sets of teacher-student models (Lenet-5+MLP and DenseNet-121+ResNet-18) and three CLL algorithms (PC, FWD and SCL-NL). Our experimental results demonstrate that models optimized by KDCL outperform those trained only with complementary labels in terms of accuracy

    A comprehensive review of graph convolutional networks: approaches and applications

    Get PDF
    Convolutional neural networks (CNNs) utilize local translation invariance in the Euclidean domain and have remarkable achievements in computer vision tasks. However, there are many data types with non-Euclidean structures, such as social networks, chemical molecules, knowledge graphs, etc., which are crucial to real-world applications. The graph convolutional neural network (GCN), as a derivative of CNNs for non-Euclidean data, was established for non-Euclidean graph data. In this paper, we mainly survey the progress of GCNs and introduce in detail several basic models based on GCNs. First, we review the challenges in building GCNs, including large-scale graph data, directed graphs and multi-scale graph tasks. Also, we briefly discuss some applications of GCNs, including computer vision, transportation networks and other fields. Furthermore, we point out some open issues and highlight some future research trends for GCNs

    Seismic loss assessment for buildings with multiple LOD BIM data

    Get PDF
    Earthquake-induced economic loss of buildings is a fundamental concern for earthquake-resilient cities. The FEMA P-58 method is a state-of-the-art seismic loss assessment method for buildings. Nevertheless, because the FEMA P-58 method is a refined component-level loss assessment method, it requires highly detailed data as the input. Consequently, the knowledge of building details will affect the seismic loss assessment. In this study, a seismic loss assessment approach for buildings combining building information modeling (BIM) with the FEMA P-58 method is proposed. The detailed building data are automatically obtained from the building information model in which the building components may have different levels of development (LODs). The determination of component type and the development of the component vulnerability function when the information is incomplete are proposed. Finally, to demonstrate the rationality of the proposed method, an office building that is available online is selected, and the seismic loss assessments with multi-LOD BIM data are performed as case studies. The results show that, on the one hand, even if the available building information is limited, the proposed method can still produce an acceptable loss assessmenton the other hand, given more information, the accuracy of the assessment can be improved and the uncertainty can be reduced using the proposed method.The study is financial supported by the National Natural Science Foundation of China (No. 51578320)
    • ā€¦
    corecore